fix: parse XML thinking and tool_call blocks in OpenRouter responses #6634

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

roomote wants to merge 1 commit into main from fix/qwen3-xml-parsing-openrouter

Contributor

roomote bot commented Aug 3, 2025 •

edited by ellipsis-dev bot

Loading

This PR fixes the issue where Qwen3 Coder model through OpenRouter sends thinking and tool call blocks as raw XML text instead of being properly parsed and displayed in their respective UI panes.

Problem

When using the qwen/qwen3-coder:free model through OpenRouter, the response contains <think>...</think> and <tool_call>...</tool_call> blocks as raw text in the content, which are displayed directly to the user instead of being parsed and shown in the appropriate UI sections.

Solution

Added XML parsing logic to the OpenRouter handler to detect and parse these XML blocks
Implemented a buffering mechanism to handle incomplete XML blocks across streaming chunks
Convert <think> blocks to reasoning chunks that display in the thinking pane
Convert <tool_call> blocks to a user-friendly format with [Tool Call]: prefix
Added comprehensive tests to ensure the parsing works correctly in various scenarios

Testing

Added 4 new test cases covering:
- Basic <think> block parsing
- Basic <tool_call> block parsing
- Multiple and nested XML blocks
- Incomplete XML blocks split across streaming chunks
All existing tests continue to pass

Fixes #6630

Important

Fixes XML parsing for <think> and <tool_call> blocks in OpenRouter responses, adding tests for various scenarios.

Behavior:
- Parses <think> and <tool_call> XML blocks in OpenRouterHandler in openrouter.ts.
- Converts <think> blocks to reasoning chunks and <tool_call> blocks to text with [Tool Call]: prefix.
- Handles incomplete XML blocks across streaming chunks.
Testing:
- Adds tests in openrouter.spec.ts for parsing <think> and <tool_call> blocks, nested/multiple XML blocks, and incomplete XML blocks.
- Ensures all existing tests pass.

^{This description was created by}^{for aa8cb2f. You can customize this summary. It will automatically update as commits are pushed.}


          fix: parse XML thinking and tool_call blocks in OpenRouter responses

aa8cb2f

- Add XML parsing for <think> and <tool_call> blocks in OpenRouter handler
- Handle incomplete XML blocks across streaming chunks
- Convert tool_call blocks to user-friendly format
- Add comprehensive tests for XML parsing functionality

Fixes #6630

roomote bot requested review from cte, jr and mrubens as code owners

August 3, 2025 21:05

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap

github-project-automation bot moved this to Triage in Roo Code Roadmap

github-project-automation bot moved this to New in Roo Code Roadmap

dosubot bot added size:L bug labels

ellipsis-dev bot reviewed

View reviewed changes

src/api/providers/openrouter.ts

    
              import { addCacheBreakpoints as addGeminiCacheBreakpoints } from "../transform/caching/gemini"

              import type { OpenRouterReasoningParams } from "../transform/reasoning"

              import { getModelParams } from "../transform/model-params"

              import { XmlMatcher } from "../../utils/xml-matcher"

Contributor

ellipsis-dev bot Aug 3, 2025

Remove unused import 'XmlMatcher' if it's not needed to avoid confusion.

Suggested change

import { XmlMatcher } from "../../utils/xml-matcher"

^{This comment was generated because it violated a code review rule: irule_Vw7dJWzvznOJagxS.}

src/api/providers/openrouter.ts

    
              				yield { type: "text", text: delta.content }

              				buffer += delta.content

              				// Process complete XML blocks

Contributor

ellipsis-dev bot Aug 3, 2025

Consider refactoring the inline XML parsing logic (lines 158–216) into a shared utility (or use the imported XmlMatcher) for improved readability and maintainability.

^{This comment was generated because it violated a code review rule: irule_tTqpIuNs8DV0QFGj.}

roomote bot mentioned this pull request

Qwen3 Coder diplays raw text for thinking and tool calls when using OpenRouter #6630

Closed

roomote bot commented

View reviewed changes

Contributor Author

roomote bot left a comment

I wrote XML parsing code when XmlMatcher was right there. Peak efficiency.

src/api/providers/openrouter.ts

    
              import { addCacheBreakpoints as addGeminiCacheBreakpoints } from "../transform/caching/gemini"

              import type { OpenRouterReasoningParams } from "../transform/reasoning"

              import { getModelParams } from "../transform/model-params"

              import { XmlMatcher } from "../../utils/xml-matcher"

Contributor Author

roomote bot Aug 3, 2025

I notice we're importing XmlMatcher but not using it. Since we've implemented custom XML parsing logic below, should we remove this unused import? Or perhaps we could consider using the existing XmlMatcher utility instead of the manual implementation?

src/api/providers/openrouter.ts

    
              					processed = false

              					// Check for complete <think> blocks

              					const thinkMatch = buffer.match(/^(.*?)<think>([\s\S]*?)<\/think>(.*)$/s)

Contributor Author

roomote bot Aug 3, 2025

The regex patterns assume well-formed XML. What happens if the model sends malformed XML like <think>content without closing tag or nested tags? The current implementation might not handle these edge cases gracefully. Should we add some validation or fallback behavior?

src/api/providers/openrouter.ts

    
              				// Process complete XML blocks

              				let processed = true

              				while (processed) {

Contributor Author

roomote bot Aug 3, 2025

For large responses with many XML blocks, this while loop with multiple regex matches could impact performance. Have we considered the performance implications? Perhaps we could optimize by combining the regex patterns or using a different parsing approach?

src/api/providers/openrouter.ts

    
              					}

              					// Check for complete <tool_call> blocks

              					const toolMatch = buffer.match(/^(.*?)<tool_call>([\s\S]*?)<\/tool_call>(.*)$/s)

Contributor Author

roomote bot Aug 3, 2025

The logic for handling <think> and <tool_call> blocks is nearly identical. Could we refactor this into a helper function to reduce duplication? Something like:

Suggested change

      
            					const toolMatch = buffer.match(/^(.*?)<tool_call>([\s\S]*?)<\/tool_call>(.*)$/s)
          
            // Helper function to process XML blocks
          
            const processXmlBlock = (buffer: string, tagName: string, transform?: (content: string) => any) => {
          
              const regex = new RegExp(`^(.*?)<${tagName}>([\s\S]*?)<\/${tagName}>(.*)$`, 's');
          
              const match = buffer.match(regex);
          
              if (match) {
          
                const [, before, content, after] = match;
          
                return { matched: true, before, content: transform ? transform(content) : content, after };
          
              }
          
              return { matched: false };
          
            };

src/api/providers/openrouter.ts

    
              					}

              					// Check if we have an incomplete tag at the end

              					const incompleteTag = buffer.match(/^(.*?)(<(?:think|tool_call)[^>]*(?:>[\s\S]*)?)?$/s)

Contributor Author

roomote bot Aug 3, 2025

This buffer management logic for incomplete tags is complex. Could we add some inline comments explaining the different scenarios? For example:

When we have an incomplete tag at the end
When we need to preserve partial tag content
When it's safe to yield all content

This would improve maintainability for future developers (including myself in 5 minutes).

hannesrudolph added the Issue/PR - Triage label

vertigo235 commented Aug 4, 2025

Isn't this also an issue when using ollama?

Member

daniel-lxs commented Aug 4, 2025

This is not a proper fix, the model is outputting the tag <think> and hallucinating the tag <tool_call/>, closing as this is an issue with the model itself

daniel-lxs closed this

github-project-automation bot moved this from New to Done in Roo Code Roadmap

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

ellipsis-dev[bot] ellipsis-dev[bot] left review comments

mrubens Awaiting requested review from mrubens mrubens is a code owner

cte Awaiting requested review from cte cte is a code owner

jr Awaiting requested review from jr jr is a code owner

Labels

bug Issue/PR - Triage size:L